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Abstract 

We give an algebraic, determinant-based algorithm for the Cycle problem, i.e., the 
problem of finding a cycle through a set of specified elements. Our approach gives a simple 
FPT algorithm for the problem, matching the C'*(2l^l) running time of the algorithm of 
Bjorklund et al. (SODA, 2012). Furthermore, our approach is open for treatment by classical 
algebraic tools (e.g., Gaussian elimination), and we show that it leads to a polynomial 
compression of the problem, i.e., a polynomial-time reduction of the if- Cycle problem 
into an algebraic problem with coding size Odifp). This is surprising, as several related 
problems (e.g., /c-Cycle and the Disjoint Paths problem) are known not to admit such 
a reduction unless the polynomial hierarchy collapses. Furthermore, despite the result, we 
are not aware of any witness for the Cycle problem of size polynomial in |if | -I- logn, 
which seems (for now) to separate the notions of polynomial compression and polynomial 
kernelization (as a polynomial kernelization for a problem in NP necessarily implies a small 
witness). 

1 Introduction 

Parameterized complexity [19^ [21] is one of the major approaches for dealing with NP-hard 
problems. In this setting, the input is associated with a parameter k, usually (but not ex- 
clusively) either a parameter related to the solution size, or a structural parameter such as 
treewidth; the fundamental assumption is that problems with a smaller parameter value will 
be easier than general instances. The critical notion is that of an FPT algorithm, which runs 
in time f{k) ■ poly(n) for some f{k) where poly(n) is independent of k, i.e., the combinatorial 
explosion is confined to the parameter k. This notion has lead to a large number of interesting 
algorithmic principles; for some surveys, see, e.g., the Festschrift of Mike Fellows [7]. 

One of the most vibrant parts of parameterized complexity in recent years is the subfield 
of kernelization. A kernelization is one of the basic approaches for creating FPT algorithms: 
It is an algorithm which runs in time polynomial in both k and n, which reduces the size 
of the instance (e.g., via reduction rules such as "remove a vertex shown not to be required 
by the solution") if the size is larger than some f{k). Additionally, beyond being a design 
paradigm for FPT algorithms, it has been observed that the notion can be a good way to 
formalize effective instance simplification, e.g., preprocessing with a performance guarantee. A 
polynomial kernel, then, is a polynomial-time procedure which takes an input instance, with 
parameter k, and produces an output instance of size at most poly(A;), regardless of the value 
of n, without changing the problem status. Great interest has been taken in recent years 
in the question of which problems (and which problem parameterizations) admit polynomial 
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kernels. This was sparked by the creation of a lower bounds framework by Bodlaender et al. [5] 
and Fortnow and Santhanam [23]. These results provided a way to exclude the existence of a 
polynomial kernel, under the hypothesis that the polynomial hierarchy does not collapse. Later 
refinements and applications of this framework can be found in, e.g., [TTl [ini [Ml [251 EOl flHl [T^. 
Significant progress has also been made on the positive side; for a few examples, see [9l[l0l[22]. A 
recent trend, relevant to the current paper, is the application of algebraic tools to kernelization, 
e.g., [29l[30]. (See related work, below.) 

Sometimes, the results found by these investigations can be quite surprising. As an example, 
consider the problems Vertex Cover (find a set of at most k vertices in a graph which covers 
all edg vertex cover of size at most k) and Connected Vertex Cover (find a 

vertex cover of size at most k which additionally is connected). The former is one of the most 
well-studied problems in theoretical computer science. In terms of parameterized complexity, 
it can be solved in time 0*{2^) by a very simple algorithm, and in time C'*(1.28'^) by more 
involved means [T2]. It has a simple 2-approximation, and a kernel of 2k vertices by the famous 
Nemhauser- Trotter theorem [36]. On the other hand, if the vertex cover is required to be 
connected, then the problem still has a simple greedy 2-approximation, an 0*(2'^)-time FPT 
algorithm [15], and, as shown by Dom et al. [18j, no polynomial kernel unless the polynomial 
hierarchy collapses. 

As another example, consider the following three problems. Given a graph G, find (a) a cycle 
with at least k vertices (the A:-Cycle problem); (b) a cycle passing through every element of a 
given set -ftT, |i^| = k (the IT-Cycle problem); or (c) a cycle passing through every element of K, 
which furthermore passes the elements in a specified order (which we may dub the Ordered 
IT-Cycle problem). Which of these seem more or less general? Which, if any, seems most 
likely do admit efficient instance simplification? 

Let us make a quick review of known FPT and kernelization results for these problems. 
All are NP-hard; in the first two problems, setting k = n yields the Hamiltonian Cycle 
problem. The /c-Cycle problem is closely related to /c-Path (the problem of finding a path 
of length at least k), and there is by now a variety of interesting techniques that can be used 
to solve it in 2^('^)poly(n) time, from the seminal color-coding technique of Alon et al. [2], via 
the multilinear detection of Koutis |28j (see also Williams [13]), to the recent 0*(1.66")-time 
Hamiltonian Cycle algorithm of Bjorklund [3], which was adapted to a parameterized setting 
in [5]. Ordered K-Cycle is equivalent to the well-known problem Disjoint Paths, where 
the input is k pairs of vertices (sj, ti), and the question is if we can connect all pairs with pairwise 
vertex-disjoint paths. This problem seems much more challenging. Robertson and Seymour, in 
the context of the graph minors programme, showed that it is FPT; more specifically, that it can 
be solved in time ©(n^) for every fixed k [37j. Kawarabayashi et al. improved this to 0{n^) for 
every fixed k [27]. However, the algorithms are in both cases very involved, and the dependency 
of the running time on k is hard to pin down exactly, but at the very least multiply exponential. 
As for polynomial kernelization, both are infeasible: /c-Path and A;-Cycle were among the 
first problems to which the lower bounds framework was applied, and Disjoint Paths was 
addressed in [11]. In both cases, the conclusion is that neither problem allows a polynomial 
kernelization (or even a polynomial-time compression into size poly (A:)) unless the polynomial 
hierarchy collapses. 

The ivT-CYCLE problem, in turn, may intuitively seem to be closer in nature to the latter 
problem than the former - e.g., it is a terminal connectivity problem, parameterized by the 
number of terminals, and there is no obvious relation between the parameter and the size 
of the solution. Indeed, the problem can be solved via applications of the Disjoint Paths 

algorithm, and Kawarabayashi solved the problem in time 2 poly(n) using graph minors- 
type graph structural reasoning However, recently, Bjorklund et al. ^ solved iC-CYCLE 
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using an approach much closer to those of the cited /c-Path algorithms: they define a large 
polynomial, which can be evaluated in 2*^ • poly(n) time, and which, when evaluated over a field 
of characteristic two, is non-zero if and only if the instance is positive. The result then follows 
from an application of the Schwartz-Zippel lemma. (In fact, they solved the more general variant 
of finding a shortest K-cycle.) For kernelization, the status of i^- Cycle is so far unknown, but 
there are several factors - the lack of a small witness, the status of the related problems given 
above, the apparent difficulty of the problem - which would suggest that the answer should be 
negative (i.e., that i^-CvCLE should have no polynomial kernel). As the present paper shows, 
this conclusion may well be mistaken. 

Our results. We give an alternative algebraic algorithm for the X-cycle problem, also 
with a running time of 0*{2'-'^''^), by encoding the problem into a variant of the Tutte ma- 
trix. More concretely, given G and K we construct a matrix Mq over GF(2^), whose entries 
are polynomials, and show that G has a X-cycle if and only if the determinant polynomial 
of Mg contains a certain type of term. Further minor modifications of the matrix yield an algo- 
rithm with running time 0*{2^), and a matrix structure such that careful application of partial 
random evaluation and Gaussian elimination can reduce Mq to a matrix A with total coding 
length 0{k^), such that it can be decided from the determinant polynomial of A whether G 
has a i^-cycle. All in all, this yields a randomized polynomial compression of X-Cycle into 
space 0{k^). The construction, and all proofs, are simple, and we need only basic arguments 
about determinants and cycle covers to complete them. 

We note that our approach so far fails to provide a polynomial kernel, in the strict sense; 
the reason being that the output is an instance of a different problem (of deciding a particular 
property of det^) which is not known to be in NP, while a kernelization requires that the 
output is an instance of the same problem. This is closely related to the issue of the witness 
size required for X-Cycle; we are not aware of a witness for either X-Cycle or for our artificial 
algebraic output problem, of size poly(A; + logn). We consider these results quite surprising. 

Related work. The Tutte matrix (see Section [2D is a skew-symmetric matrix of indetermi- 
nates, created from the adjacency matrix of a graph G, which is non-singular if and only if G has 
a perfect matching [51]. This can be used to determine the size of a maximum matching in ran- 
domized time ©(n'^) [32l[3l|, where u < 2.3727 is the matrix multiplication exponent [421I391 [T3]. 
Mucha and Sankowski [34] showed how to find a maximum matching in the same time. Gee- 
len [25] gave a deterministic polynomial-time procedure which finds a maximum rank evaluation 
of the Tutte matrix (which does not lead to a competitive deterministic matching algorithm, 
but may be of interest for the general question of removing randomness due to applications of 
Schwartz-Zippel). For non-algebraic algorithms for matching, Micali and Vazirani [33] gave an 
algorithm that finds a maximum matching in general graphs in time 0{my/n). 

Algebraic FPT algorithms, beyond those cited for fc-PATH above, have been used by, e.g., 
Lokshtanov and Nederlof [31J and Cygan et al. [H]. See also Nederlof's PhD thesis [35]. More 
specifically, algorithms based around determinant computations have been used by Bjorklund [4] 
[3]. However, we argue that the approach of the present paper leads to significantly simpler 
algorithms and correctness proofs than before. Algebraically based kernelizations, in particular 
using tools of matroid theory, have been given in [29( l30j . Related to the present work, it 
is interesting to note that the result of [29] was a pure compression, albeit within NP (the 
problem Odd Cycle Transversal was encoded into matroid, represented by a matrix of total 
coding length poly(A;)), while [30] gave graph-based reduction rules, significantly broadening the 
applicability of the tools. A similar improvement on the tools of the present work would be 
highly interesting. 
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Organization. We review some basic definitions in the next section, then Section [3 gives, 
in turn, a very simple 0*(4'^)-time for IT-Cycle; an improvement to an 0*(2'^)-time algorithm; 
and the Gaussian polynomial compression. 



2 Preliminaries 



Parameterized complexity. A parameterized problem is a language Q C S* x N; the 
second component of instances (x, k) is called the parameter (cf. [H]). A parameterized problem 
is fixed-parameter tractable (FPT) if there is an algorithm A and a computable function / : N — )■ 
N such that A decides {x, A;) G Q in time f{k)\x\^^-^\ A kernelization of Q is a polynomial-time 
computable mapping K: S* x N — )• S* x N: {x,k) ^ {x',k') such that {x,k) G Q if and only 
if {x',k') £ Q and with |rE'|,A;' < h{k) where /i is a computable function; h is called the size of 
the kernel and X is a polynomial kernelization if h{k) is polynomially bounded. A polynomial 
compression is a polynomial kernelization relaxed so that the output may be an instance of a 
(fixed) different language than the input language. This has also been called bikernel [Ij and 
generalized kernelization [8]). 

The Tutte matrix. Let G = (V, E) be a simple undirected graph with V = {ui, . . . , 
The Tutte matrix Aq is the n x n matrix of indeterminates such that 



Xij if ViVj G E and i < j, 
—Xji if ViVj G E and i > j, 
otherwise. 



where Xij are distinct commuting variables. Tutte |41j showed that det Aq / (viewed as a 
polynomial) if and only if G has a perfect matching. Lovasz |32] showed the applications of this 
type of result to randomized algorithms. 



Determinants and cycle covers. We recall a few basic facts. Let D = {V, E) be a 
directed graph, which may contain loops. A cycle cover of D is a set C E of arcs such that 
every vertex in D has in- and out-degree exactly one in C. We allow loops to be present in 
the cycle cover. For an undirected simple graph G, which again may contain loops, an oriented 
cycle cover of G is a cycle cover of the bidirectional graph corresponding to G. (Note that this 
implies that loops and isolated edges are permitted in the cycle cover, corresponding to cycles 
of length one respectively two.) 

Let A be an n X n matrix over a field of characteristic two. Then the determinant and the 
permanent of A coincide: 

n 

det^ = perm(A) = ^ JJ^(i,a(i)), (1) 

where Sn is the set of all permutations of [n]. Let D be a directed graph on vertex set V = 
{vi, . . . ,Vn} such that ViVj G E{D) if and only if A{i,j) ^ (where ViVi denotes a loop on the 
vertex Vi). There is a well-known bijection between terms of the naive summation ([T]) of the 
determinant and cycle covers of D, as follows. 

Proposition 1. Let D = {V,E) and A be as above. For a permutation a G Sn, let = 
{'^iVa(i) '■ i £ Ml- If C E, then is a cycle cover of D; furthermore, this describes a 
bijection between cycle covers of D and non-zero terms in the summation ^\). 
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Let C be a simple cycle in a cycle cover €„. We call C reversible if the cycle has length at 
least three and for every edge ViVj G C, we have A{i,j) = A{j,i)] further, we call reversible 
if it contains at least one reversible cycle. A critical observation, both in previous and present 
work, is that reversible cycle covers cancel in ([1]). 

Proposition 2. // ^^ is computed over afield of characteristic two, then the terms correspond- 
ing to reversible cycle covers cancel each other. 

Proof. Let be a reversible cycle cover, and let C be the first reversible cycle of C^, counted 
by vertex incidence (i.e., the cycles of are sorted according to the number of the earliest 
incident vertex). Let C = C^i be the cycle cover resulting from reversing C. Then this operation 
creates a fix-point-free involution among the reversible cycle covers. Further, as the terms of ([1]) 
corresponding to a and a' are identical by definition, all terms of ([TJ corresponding to reversible 
cycle covers will cancel each other out. □ 

Thus, when reasoning about the surviving terms of det^, we only need to concern ourselves 
with non-reversible cycle covers. (In particular, \i A = Aq is the Tutte matrix of a graph G 
over a field of characteristic two, then every cycle of length more than two is reversible, and 
there are no cycles of length one; thus the non-reversible cycle covers are exactly the perfect 
matchings of G.) 

Schwartz-Zippel. We also recall the Schwartz-Zippel lemma. 

Lemma 1 (Schwartz-Zippel [Ml El)- Let P{ xi, . . . ,Xn) be a multivariate polynomial of total 
degree at most d over a field F, and assume that P is not identically zero. Pick ri,...,r„ 
uniformly at random from ¥. Then Pr(i-*(ri, . . . ,r„) = 0) < d/\F\. 

We will use this mostly, though not exclusively, for the case that P is the determinant of a 
matrix over GF(2^). 

Detecting monomials in a polynomial. Our final generic ingredient is an application 
of inclusion-exclusion to finding certain monomials in a polynomial over a field of characteristic 
two. For a polynomial P and a monomial m, we let P{m) denote the coefficient of m in P. We 
need a way to extract from P only those monomials divided by a certain term. 

Lemma 2. Let P{xi, . . . ,Xn) be a polynomial over a field of characteristic two, and T C [n] 
a set of target indices. For a set I CI [n], define P_j-(xi, . . . , x„) = P{yi, . . . , yn) where yi = 
for i G I and yi = Xi otherwise. Define 

Q{xi, ...,Xn) = '^ P-l{xi, . . . ,Xn). 
ICT 

Then for any monomial m such that t := IlieT-^* divides m we have Q{m) = P{m), and for 
every other monomial we have Q{m) = 0. 

Proof. Consider a monomial m with non-zero coefficient in P. Observe first that for every / C 
[n], we have P^j{m) = P[m) if no variable Xi with i £ I occurs in m, and P^i{m) = 
otherwise. Now, if t divides m, then out of the 21^' evaluations, the monomial m occurs in 
exactly one (namely, / = 0). Thus, Q{m) = P{m). If t does not divide m, let J = {i G I : 
Xi does not divide m}, and observe that P^j{m) = P{m) for every I O J. Since J / 0, this is 
an even number of occurrences of the same monomial with the same coefficient, which implies 
that they sum to zero. Applying this argument individually to every monomial in P accounts 
for all occurrences of monomials in the sum defining Q; the result follows. □ 
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We remark that we do not require P to be multilinear (although we do require T to be a 
set rather than a multiset). 

3 An Algebraic FPT Algorithm 

We now give our algorithm and compression for iC-CYCLE. Let us first fix a definition. 

Definition 1. For a vertex v eV, a v-cycle is a cycle that passes through v. For a set T C.V, 
a T-cycle is a cycle that passes through all vertices of T. In both cases, the cycle may pass 
through further vertices, but this is not required. 

The problem is then formally defined as follows. 

iC-CYCLE 

Input: A graph G = {V, E); a set K C y of terminal vertices 

Parameter: k := \K\ 

Question: Is there a i^-cycle in G7 

We will show two algebraic FPT algorithms for this problem, giving two ways of encoding 
it into the determinant of a matrix. We then show how this implies a polynomial compression 
via Gaussian elimination, into space 0{k^). 

3.1 Graph preprocessing 

We begin with a simple preprocessing of the graph (reducing the terminals to degree two). 

Lemma 3. Let {G,K) be an instance of X-Cycle with \K\ > 1. We can reduce {G,K) to an 
equivalent instance {G',K'), \K'\ = \K\, where d{v) = 2 for every v £ K' , and where K' is an 
independent set with no common neighbours. 

Proof. We assume that K is an independent set in G (by subdividing edges within K, if nec- 
essary). Construct G' from G by replacing every terminal v £ K hy two non-adjacent copies 
v', v" (with neighbourhoods identical to that of v). Create a new vertex v with N{v) = {v', v"}. 
The new terminal set K' consists of these new vertices v. 

It is easy to show that this reduction maintains the solution status. On the one hand, for 
any if-cycle in G, we may replace each portion u — v — w oi the cycle, with v E K and hence 
u,w^K,hya path u — v' — v ~ v" — w, hitting the new terminal v. On the other hand, any 
X'-cycle in G' must pass through both neighbours v', v" of each terminal v £ K', and these 
neighbours are distinct for all terminals. Thus if each segment v' — v — v" of the i^T'-cycle in G' 
is contracted into v, we get a valid K-cycle in G. □ 

The requirement that |ivr| > 1 comes from the consideration of whether a single edge uv 
should be considered a K-cycle with K = {u} (but the case \K\ = 1 is in either case easily 
solvable in polynomial time). 

For the rest of the paper, for convenience, we will let G = {V, E) be a graph, on vertex set 
V = {vi, . . . , Vn} and terminal set K = i, . . . , Vk}, to which the above reduction has already 
been applied. We also assume N{vi) = {ffe+2i-i,i'fe+2i} for i £ [k\. 



6 



3.2 Matrix construction 

We now show the matrix which will encode the existence of a X-cycle. We begin with a 
more intuitive construction, that implies a running time of 0*{4'^), then modify it to arrive at 
the C'*(2^)-time algorithm and polynomial compression. 

Given a graph G, reduced as per the previous subsection, we define the matrix Aq as follows. 
We start from the Tutte matrix Aq of G (although, as the field is of characteristic two, we will 
not observe the signs), and adjust so that A{i,i) = 1 for i > 3A; (effectively adding self-loops 
to all vertices except N[K]). Finally, we orient the edges incident to vi to make fi -cycles non- 
reversible: let ^(1, A; + 1) and A{k + 2, 1) be unmodified, but set A{1, k + 2) = A{k + 1,1)= 0. 
This can be done safely, as any ii'-cycle of G can be oriented in either direction. 

Let Mg denote the resulting matrix. We can detect a X-cycle in G as follows. 

Theorem 1. Let T = {xi^k+2i-iTXi^k+2i '■ ^ £ [k]} and t = HzeT-^- 'Then G has a K-cycle if 
and only if detMc viewed as a polynomial, contains a monomial m with non-zero coefficient 
such that t divides m. 

Proof. Recall the summation ([1]) and the notion of a reversible cycle from Section [2j We claim 
a one-to-one correspondence between non-zero monomials of detMc and non-reversible cycle 
covers. 

This follows from basic observations, but we prove it for completeness. Since ([1]) is already in 
sum-product form, every non-zero monomial of det Mq corresponds to a non-empty set of sum- 
mands from ([1]). By Prop. [2l we get the same result if we restrict ourselves to those summands 
corresponding to non-reversible cycle covers. We show that two summands, corresponding to 
distinct non-reversible cycle covers C,C', always produce distinct monomials: if C and C use 
distinct sets of underlying undirected, non-loop edges, then the claim is clear, and the set of loop 
edges of a cycle cover is a function of the set of non-loop edges. In the remaining case, C' must 
be attainable by a reorientation of C. However, there are by construction only three types of 
non-reversible cycles: loops, isolated edges, and f i-cycles, where a t;i-cycle cannot be reversed, 
and loops and isolated cycles are invariant under reversal. Thus C and C must produce distinct 
monomials. 

The result is now simple. First, if C is a -fC-cycle in G, then it contributes all factors in t, 
and by padding G using self-loops we produce a non-reversible cycle cover, which produces a 
non-zero monomial of detM^r. On the other hand, if a non-zero monomial in detM^ contains 
the factor Xi^k+2i-iXi^k+2i, then in the corresponding cycle cover, the ui-cycle most also pass 
through Vi, as such a factor cannot be contributed by loops and isolated edges. By induction, 
if a non-zero monomial in det Mq is divided by t, then the corresponding cycle cover contains 
a vi-cycle which passes through every vertex of K, i.e., a X-cycle. □ 

As |r| = 2k, this implies an C'(2'^'^)-time randomized algorithm for the problem, via Lemma[2] 
and by evaluating the resulting polynomial Q randomly over GF(2^) for i = Q{logn). We will 
improve this in two ways: by introducing a modification which will let us match the 0*{2'') 
running time of 0, and by showing how to use Gaussian elimination and partial random 
evaluation to produce a polynomial compression. 

3.3 A 2^ Algorithm 

We now show a different way to determine the existence of a iiT-cycle from Mq. Let an orienta- 
tion of Mg be the result of, for every Vi € K , i > 1, either setting A{k-\-2i—l, i) = A[i, k+2i) = 
or A{k + 2i,i) = A{i,k + 2i — l) = 0, i.e., orienting the edges incident to Vi either as v[ — )• f j — )• v'l 
or as v[->^ Vi ^ v" . We claim the following. 
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Theorem 2. Let Q' be the sum of det Mq over all 2^ ^ orientations Mq of Mq- Then G has 
a K-cycle if and only if Q' is not identically zero. 

Proof. Let Mq be an arbitrary orientation of Mq ■ As in Theorem [U monomials of det Mq 
correspond to non-reversible cycle covers, but now, every cycle incident on some Vi G K counts 
as non-reversible (and again, attempting to reverse such a cycle produces a zero-term). On the 
other hand, if two orientations Mq and Mq contain non-reversible cycle covers C and C" such 
that C" can be obtained by reorienting C', then C' and C" contribute identical monomials to the 
sum, and their contribution may cancel. Thus, let C* be an unoriented cycle cover, such that 
every cycle in C* is either a loop, an isolated edge, or a cycle incident on K, and such that K 
is covered entirely by the latter type of cycles. We will count the number of contributions of 
orientations of C* to the sum. 

For this, simply observe that in a single cycle C of C*, as soon as the orientation of at least 
one vertex of C has been determined, the direction taken through every other vertex of C is 
fixed as a consequence. Thus, if C* contains a i^-cycle C, then only one orientation Mq is 
possible, as the vertex vi enforces a direction already in Mq. On the other hand, if C* contains 
at least two cycles incident on K, then all cycles not incident on vi may be oriented arbitrarily, 
making for an even number of orientations, each one of which contributes the same monomial 
to the sum. 

Thus non-zero monomials of Q' correspond to ET-cycles in G, as promised. □ 

This construction brings our algorithm closer in spirit to the determinant sums of Bjorklund [?l 
[3], or the algebraic FPT algorithms of Cygan et al. [15]. However, as the next subsection shows, 
by bringing the algorithm back into the structure of deciding properties of the determinant 
polynomial of a single matrix, we get a randomized polynomial compression for X-Cycle via 
Gaussian elimination. 

3.4 Polynomial Compression 

Now, we finally show how to use the above for a polynomial compression of the X-Cycle 
problem. 

We describe one final modification of the matrix Mq. For every -^j G iT, i > 1, we introduce 
a new variable Oj, and multiply A{k + 2i — \,i) and A{i, k + 2i) by Oj, and A{k + 2i,i) and A{i, k + 
2i — \) by 1 — Oj. Observe that this implies that the algorithm of Theorem [2] can be executed by 
iteratively setting each ai to either 1 or 0, and computing the determinant each time. Strictly 
speaking, each individual determinant computation would then seem to require a fresh dose 
of randomness, via the Schwartz-Zippel evaluation step, making the approach inappropriate 
for kernelization. We show that it is possible to perform this in the alternate direction, first 
randomly evaluating every variable Xe for e G E{G), then performing Gaussian elimination into 
a compressed output, and finally (at some future time) performing the 2^~'^ assignments to the 
variables Oj and computing the resulting determinants. 

Theorem 3. The i^-CYCLE problem has a randomized polynomial compression of size 0{k'^). 

Proof. We get the result in two steps, first showing that we can randomly evaluate the variables x 
while leaving a as indeterminates, then applying Gaussian elimination to produce a smaller 
matrix with the same determinant (viewed as a polynomial in a). 

Let -P(x, a) be the determinant polynomial of Mq. Define Q(x) to be the sum over the 2^~^ 
instantiations of a necessary to emulate the algorithm of Theorem [2j observe that (5(x) is a 
polynomial of degree n, and that Q{x) is identically zero if and only if G has no K-cyde. Thus, 
again by Schwartz-Zippel, we may instantiate x randomly from GF(2^), and with probability 
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at least the resulting values are such that the 2'^^^-sized evaluation of (5(x) would return 
non-zero. Picking I = 0(logn) is sufficient for this step to succeed with polynomial probability 
in n (and with £ = 0(logn + k), we get a failure rate still polynomial in n, but exponentially 
small in k). Note that by standard observations we may assume k > logn, as otherwise the 
0*(2'=)-algorithm runs in polynomial time. 

Now, observe that we only need to know the existence of the polynomial Q for the above cor- 
rectness argument. Thus, by replacing x randomly by values from GF(2^), we get a matrix M' 
with mostly concrete values, and indeterminates in the top-left 3A; x 3A; corner, such that pre- 
serving det M' is sufficient (up to the failure probability in the previous step) for preserving the 
information of whether G has a K-cjcXe. 

Next, recall that row and column operations preserve the determinant of a matrix exactly. 
We show that we can reduce M' to a blocks form 



where C is a matrix without indeterminates. Thus we will have detM' = (det yl) (det C) 
where det C is a constant. 

This is easy. For sets i?, C C [n], let M[R,C] denote the induced submatrix of M with 
rows R and columns C. Observe that the submatrix Mg'[[3A; + 1, n], [3A; + 1, n]] is non-singular, 
as the diagonal contributes the term 1 to the determinant and every other term will contain at 
least one indeterminate. Thus (up to the failure probability), M'[[3fc -t- 1, n], [3A: -|- 1, n]] is non- 
singular, and can be reduced to diagonal form with a non-zero diagonal, without introducing 
any new indeterminate entries in M' . Now we can use further row and column operations to 
reduce M'[[l,3fc], [3/c + l,n]] and M'[[3A; -|- l,n], [1,3/c]] to all-zero matrices (thereby modifying 
the contents of M'[[l,3/c], [1,3A:]], but not M'[[3fc + l,n], [3A: + l,n]]). This creates the desired 
blocks form, and every step preserves the determinant precisely and is performed without further 
failure probability or growth of the individual entries (since we are working over a finite field) . 

Finally, we consider the resulting contents of the matrix A. Initially, the entries of M'{i,j) 
for i,j < 3k are either constants, or expressions Oj -c + c' for some constants c,cf. Every further 
row or column operation that modifies these entries adds some concrete value c' to the entry, 
meaning that we can maintain these entries in the form a.j ■ c + c' where c, c' arc concrete vahics 
from GF(2^); thus the coding length remains 0{£) bits per entry. We then multiply one arbitrary 
row of A by detC, which again only has the effect of modifying the values c, c'. This gives us 
a 3A; X 3A; matrix A', with entries encoded into 0{i) = 0{k) bits, such that det^' = detM', 
where M' is the matrix produced by randomly instantiating x in Mq. □ 

Finally, we remark that, unusually, the output problem is not trivially in NP (as it is a 

question about the outcome of an exponentially large computation). Thus in terms of parame- 
terized complexity, we do not strictly speaking get a polynomial kernel, as we know of no way 
of getting back from the matrix A' above to an instance of iC-CYCLE. 

4 Conclusions 

We have shown an alternate algebraic algorithm for the X-Cycle problem, recasting the origi- 
nal problem into a question about the existence of certain terms in the determinant polynomial 
of a matrix with indeterminate entries. By careful application of partial evaluation and Gaussian 
elimination, we have shown that this leads to a polynomial compression of a X-Cycle instance 
into space 0(|i^p). This partially answers the question of the kernelizability of iT-CYCLE, in 
a perhaps surprising direction. 
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Although we are not able to produce a proper kernel, since we are not able to get back 
to an instance of the ivT- Cycle problem, such kernel- like polynomial compressions have been 
previously considered in parameterized complexity [IJ, and in fact all existing frameworks for 
excluding polynomial kernelization (e.g., [23,20]) also exclude polynomial compressions. Thus, 
for the sake of a smooth theory, we hope that the i^-CYCLE problem can also be shown to have 
a polynomial kernel (e.g., a compression within NP). 

Another interesting improvement would be a more direct kernel, e.g., based on reduction 
rules which make direct modifications to G and K. The tools required for finding such rules 
may well have further applications (perhaps analogously to the two previous works [291 130j). 

It would also be interesting to consider further related problems, perhaps starting with the 
problems of finding a shortest iC-cycle, and a -fC-cycle with a prescribed parity, as these problems 
can also be solved by the approach in [6]. While it seems that our algorithm can be adapted for 
this setting, it is not clear to us at the moment whether this can be done in a way that allows 
for a polynomial compression. 

Acknowledgements The author is grateful to Thore Husfeldt and Stefan Kratsch for 
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